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Abstract — This  study  illustrates  the  importance  of  assessing 
and  accounting  for  time  heterogeneity  in  longitudinal  social  net¬ 
work  analysis.  We  apply  the  time  heterogeneity  model  selection 
procedure  of  [1]  to  a  dataset  collected  on  social  tie  formation 
for  university  freshman  in  the  Netherlands  by  [2].  Within  the 
context  of  analyzing  selection  effects  for  smoking  homophily  to 
understand  the  implications  of  tobacco  policy  at  a  university, 
we  show  that  failing  to  account  for  time  heterogeneity  yields 
quite  different  results  substantively  from  the  model  arrived  at 
using  [1],  While  the  results  are  limited  by  the  small  scope  of 
the  dataset,  the  paper  motivates  the  testing  of  time  heterogeneity 
within  longitudinal  studies  of  social  network  behavior  and  further 
study  of  tobacco  policy  within  university  settings. 

I.  Introduction 

The  study  of  social  relationships  and  health  has  interested 
researchers  for  some  time.  [3]— [5]  study  health  outcomes 
as  embedded  in  social  support  mechanisms  and  found  a 
significant  role  for  social  relationships  on  mortality  rates.  In 
other  contexts,  researchers  have  found  that  social  relationships 
can  also  propagate  risky  behavior.  Selection  effects-whereby 
people  choose  to  become  friends  with  similar  individuals 
(see  [6])-and  influence  effects-whereby  people  are  influenced 
towards  risky  behavior  by  their  social  contacts  (see  [7]- 
[9])-could  provide  environments  in  which  social  relationships 
increase  factors  of  mortality  and  morbidity.  Researchers  have 
studied  smoking  behaviors  embedded  in  social  networks  in  a 
number  of  ways,  e.g.  through  analysis  of  social  positions  [10], 
traditional,  longitudinal  statistical  models  [11],  and  stochas¬ 
tic  actor  based  models  [9].  Specific  policy  implications  of 
tobacco-related  workplace  health  promotion  has  been  studied 
extensively  (e.g.  [12]),  and  have  important  implications  for 
health  outcomes. 

Due  to  an  array  of  difficulties  in  statistical  analysis  of 
social  network  data,  the  use  of  traditional  methods  may  lead 
to  erroneous  results.  Chief  among  these  difficulties  is  the 
independence  of  observations  assumption  maintained  for  many 
of  these  traditional  methods.  Using  recently  developed  tools 
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for  the  longitudinal  study  of  actor  behavior  embedded  in 
social  networks  (see  [13],  [14]),  researchers  have  made  strides 
towards  overcoming  many  of  these  difficulties. 

This  study  revisits  a  small  dataset  collected  by  Van  de  Bunt 
[2],  [15]  which  can  be  used  to  explore  the  effects  of  designated 
smoking  areas  on  relationship  formation  among  university 
students,  and  illustrates  why  the  use  of  tools  to  assess  and 
account  for  time  heterogeneity  developed  by  [16]  and  applied 
by  [1],  [17]  represent  a  crucial  part  of  the  modeling  process. 

With  methods  to  better  detect  and  account  for  time  het¬ 
erogeneity  in  parameterizations  of  coevolution  models,  re¬ 
searchers  can  be  more  confident  in  the  results  of  inference 
drawn  from  survey  data  used  to  study  selection  and  influence 
effects  on  smoking  behavior.  These  inferences  can  be  used  to 
guide  healthcare  policy  and  prevention  programs  that  target 
the  sorts  of  risk  factors  observed  in  the  data. 

II.  Data 

The  data  from  this  study  comes  from  [2],  [15],  who  col¬ 
lected  data  for  university  freshmen  over  seven  time  periods.1 
Of  a  body  of  49  students,  17  either  dropped  out  of  the  program 
or  did  not  respond  more  than  three  times,  yielding  a  dataset 
of  7  time  periods  for  32  students.  There  are  four  distinct 
programs  which  the  students  were  assigned  to:  a  two  year, 
a  three  year,  and  a  four  year  program.  The  sex  and  smoking 
behavior  (yes  or  no)  were  also  recorded  at  the  onset  of  the 
study.  The  sociometric  data  was  collected  at  each  observation, 
and  permitted  the  students  to  assign  one  of  six  categories  listed 
in  Table  I  to  their  relationship  with  each  of  the  other  students 
in  the  study.  In  the  setting  of  the  university,  the  smokers  would 
have  to  separate  themselves  from  the  rest  of  the  group  to  visit 
the  smoking  area.  [2]  remarks  that  they  did  so  often.  This 
smoking  area  was  designated  far  away  from  where  students 
and  faculty  would  gather  for  coffee  breaks  between  lectures. 
Students  of  the  three  programs  had  overlapping  curricula  for 

1  The  first  four  time  points  are  three  weeks  apart,  and  the  last  three  time 
points  are  six  weeks  apart 
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Public  reporting  burden  for  the  collection  of  information  is  estimated  to  average  1  hour  per  response,  including  the  time  for  reviewing  instructions,  searching  existing  data  sources,  gathering  and 
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Label 

Best  friendship 
Friendship 

Friendly  relationship 

Neutral  relationship 

Unknown  person 
Troubled  relationship 


Description 

Persons  whom  you  would  call  your  real  friends. 

Persons  with  whom  you  have  a  good  relationship,  but  whom  you  do  not  (yet)  consider 
a  ’real’  friend. 

Persons  with  whom  you  regularly  have  pleasant  contact  during  classes.  The  contact 
could  grow  into  a  friendship. 

Persons  with  whom  you  have  not  much  in  common.  In  case  of  an  accidental  meeting 
the  contact  is  good.  The  chance  of  it  growing  into  a  friendship  is  not  large. 

Persons  whom  you  do  not  know. 

Persons  with  whom  you  can’t  get  on  very  well,  and  with  whom  you  definitely  do  not 
want  to  start  a  relationship.  There  is  a  certain  risk  of  getting  into  a  conflict. 


TABLE  I:  Labels  and  descriptions  for  the  response  categories  of  the  Van  de  Bunt  1999  student  survey  dataset. 


2468  10  2468  10 

(a)  In  Degree  Distribution  (b)  Out  Degree  Distribution 

Fig.  1:  Kernel  density  plots  for  average  degree  and  outdegree 
across  all  observations.  Best  friendship  is  given  by  the  black 
dotted  line,  friendship  by  the  red  dotted  line,  and  friendly 
friendship  by  the  green  dotted  line.  The  bold,  solid,  black  line 
corresponds  to  the  sum  of  all  three  friendly  relationships. 

the  first  few  months,  but  diverged  afterwards  (especially  for 
the  two  year  program). 

Figure  1  displays  kernel  density  plots  on  the  in  degree  and 
outdegree  distributions  of  the  survey  responses  for  the  friendly 
relationships  (i.e.  best  friendship,  friendship,  and  friendly 
friendship).  Experience  with  such  datasets  suggests  that  mean¬ 
ingful  relationship  definitions  (i.e.  friendship)  should  result  in 
degree  distributions  with  most  probability  mass  between  three 
and  eight;  the  density  plots  indicate  a  compliance  with  such 
a  criteria.  As  in  the  original  study,  we  dichotomize  friendly 
relationship,  friendship,  and  best  friendship  into  a  present  tie, 
or  a  1  on  the  digraph,  and  all  other  ties  to  a  0. 

Another  important  feature  of  network  data  is  the  amount 
of  tie  turnover-that  is,  the  amount  of  links  or  ties  which 
persist  from  observation  to  observation  versus  those  that 
change.  Table  II  shows  the  frequency  of  the  four  possible 
outcomes  for  link  transition.  The  Jaccard  similarity  coefficient 
J  is  also  reported,  which  is  a  good  indication  of  how  much 
network  turnover  occurs  from  period  to  period.2  The  number 
of  persistent  ties  generally  increases  from  period  to  period, 

2The  Jaccard  index  is  given  by 
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J 
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TABLE  II:  Network  turnover  frequency  corresponding  to  the 
period  indicated  by  column  and  the  tie  outcomes  indicated 
by  row.  The  Jaccard  index  J  is  reported  for  each  period  on 
the  bottom  row.  For  example,  the  cell  corresponding  to  row 
1  — >  0  and  column  1  — >  2  has  value  43,  meaning  that  43 
relationships  were  present  in  the  first  observation  but  deleted 
during  the  second. 

and  tie  creation  generally  increases  substantially  over  the  life 
of  the  study.  With  Jaccard  indices  close  to  .5,  we  should  have 
reasonable  power  to  estimate  statistical  parameters  (see  [13]). 

This  data  is  available  for  download  from  the  Siena  website 
at  http://stat.gamma.rug.nl/siena_datasets.htm. 

III.  Stochastic  Actor  Oriented  Models  (SAOM) 

A  social  network  composed  of  n  actors  is  modeled  as  a 
directed  graph  (digraph),  represented  by  an  adjacency  matrix 
(Xij)nxn,  where  xi7  =  1  if  actor  i  is  tied  to  actor  j,  Xij  =  0 
if  i  is  not  tied  to  j,  and  Xu  =  0  for  all  i  (self  ties  are  not 
permitted).  It  is  assumed  here  that  the  social  network  evolves 
in  continuous  time  over  an  interval  T  C  R1  according  to 
a  Markov  process.  Accordingly,  the  digraph  x(f)  models  the 
state  of  social  relationships  at  time  t  £  T.  Changes  to  the 
network  called  updates,  occur  at  discrete  time  points  defining 
the  set  £  C  T.  Elements  of  the  set  are  denoted  La  with 
consecutive  natural  number  indices  a  so  that  Li  <  L2  < 
...  <  L|£|,  where  the  notation  |.|  is  used  to  denote  the  number 
of  elements  in  a  set.  The  network  is  observed  at  discrete  time 

or  the  number  of  ties  which  persist  from  observation  to  observation  divided 
by  the  sum  of  persistent  ties  and  ties  which  are  created  and  destroyed.  The 
range  of  this  measure  is  from  0  to  1,  where  higher  numbers  indicate  less 
network  turnover. 


points  called  obsen’ations  defining  the  set  Mi  with  elements 
Ma  indexed  similarly  with  consecutive  natural  number  indices 
a  so  that  Mi  <  M2  <  ■■■  <  M\m\-  Define  a  set  of  periods 
with  elements  Wa  £  VV  each  representing  the  continuous  time 
interval  between  two  consecutive  observations  Ma  and  M0+i : 

Wa  =  [Ma,  Ma+i]  =  {t£ff  :  Ma  <t<  Ma+i}. 


A.  Rate  Function 

The  rate  function  describes  the  rate  at  which  an  actor  i 
updates  tie  variables.  Waiting  times  between  opportunities  for 
actor  i  to  make  an  update  to  the  digraph  are  exponentially  dis¬ 
tributed  with  rate  parameter  A,(x),  and  it  follows  that  waiting 
times  between  any  two  opportunities  for  updates  across  all 
actors  are  exponentially  distributed  with  rate  parameter 


By  definition,  |W|  =  \Mi\  —  1.  When  C  =  Mi,  we  have  full 
information  on  the  network  updates  over  the  interval  T.  We 
use  upper  case  to  denote  random  variables  (e.g.  X,  M,  L). 

There  is  a  variety  of  models  proposed  in  the  literature 
for  longitudinally  observed  social  networks.  In  this  paper, 
we  consider  the  approach  proposed  by  [13],  the  stochastic 
actor  oriented  model  (SAOM).  Here,  the  stochastic  process 
{X(i)  :  t  £  T}  with  digraphs  as  outcomes  is  modeled  as  a 
Markov  process  so  that  for  any  time  ta  £  T,  the  conditional 
distribution  for  the  future  {X(f)  :  t,  >  ta}  given  the  past 
{X(t)  :  t  <  to]  depends  only  on  X(fa). 

From  the  general  theory  of  continuous-time  Markov  chains 
[18]  follows  the  existence  of  the  intensity  matrix  that  describes 
the  rate  at  which  X(f)  =  x  tends  to  transition  into  X.(t+dt )  = 
x  as  dt  — >  0: 


q(x,x)  =  lim 

dtyo 


P{X(t  +  dt)  =  x  |  X(t)  =  x} 
dt 


(x/x), 

(1) 


where  x  £  X.  The  SAOM  supposes  that  a  digraph  update 
consists  of  exactly  one  tie  variable  change.  Such  a  change  is 
referred  to  as  a  ministep.  This  property  can  be  expressed  as 

g(x,  x)  >  0  =>  y  abs(xij  -  Xjj)  =  1  (2) 

where  abs(.)  denotes  the  absolute  value.  Therefore  we  can  use 
the  notation 


5ij(x)  =  q(x,  x)  where  xtj  xtj  (3) 

SAOMs  consider  two  principal  concepts  in  constructing  the 
intensity  matrix:  how  often  actors  update  their  tie  variables  and 
what  motivates  their  choice  of  which  tie  variable  to  update. 
This  is  expressed  by  the  formulation 


A+(x)  =  ^  A,:(x).  (5) 

i 

It  is  possible  to  specify  any  number  of  functional  forms  for 
Aj(x),  to  include  combinations  of  actor-level  covariates  and 
structural  properties  of  the  current  state  of  the  network  x; 
however,  in  many  applications,  rate  functions  are  modeled  as 
constant  terms. 


B.  Evaluation  Function 

Once  an  actor  i  is  selected  for  an  update,  the  actor  must 
select  a  tie  variable  xl:l  to  change.  Define  x(i  j)  £  X  as  the 
digraph  resulting  from  actor  i  modifying  his  tie  variable  with 
j  during  a  given  time  period  t  so  that  Xij(i  j)  =  1  —  Xij, 
and  formally  define  x(i  *)  =  x. 

The  SAOM  assumes  that  the  probabilities  ptj  (x)  depend  on 
the  evaluation  function  that  gives  an  evaluation  of  the  attrac¬ 
tion  toward  each  possible  next  state  of  the  network,  denoted 
here  by  f)j  (x).  This  attraction  is  conveniently  modeled  as  a 
linear  combination  of  the  relevant  features  of  each  potential 
change  i  j: 

j fij{ x)  =  /3TSi{x(i  j))  (6) 


where  Si  is  a  vector-valued  function  containing  structural 
features  of  the  digraph  as  seen  from  the  point  of  view  of 
actor  i,  and  B  is  a  statistical  parameter.  [13],  following  the 
the  econometric  literature  on  discrete  choice  (see,  e.g.,  [19], 
[20]),  models  the  choice  of  i  j  as  a  myopic,  stochastic  op¬ 
timization  of  a  conditional  logit.  This  amounts  to  choosing  the 
greatest  value  (x)  +  where  e,tj  is  a  Gumbel  distributed 
error  term.  This  leads  to  the  conditional  choice  probabilities 
Pij  (x)  that  actor  i  chooses  to  change  tie  variable  i  j  given 
the  current  digraph  x: 


Pij(x) 


exp  fjj(x) 

ELi  exp  fik  (x) 


(7) 


qij(x)  =  Ai(x)Pij(x).  (4) 

The  interpretation  is  that  actor  i  gets  opportunities  to  make 
an  update  in  her/his  outgoing  tie  at  a  rate  of  Aj(x)  (which 
might,  but  does  not  need  to,  depend  on  the  current  network); 
if  such  an  opportunity  occurs,  the  probability  that  i  selects  x^ 
as  the  tie  variable  to  change  is  given  by  Pij(x).  The  actors  are 
not  required  to  make  a  change  when  an  opportunity  occurs, 
which  is  reflected  by  the  requirement  JA  pl3(x)  <  1,  without 
the  need  for  this  to  be  equal  to  1 .  The  probabilities  p,j  (x)  are 
dependent  on  the  so-called  evaluation  function,  as  described 
below. 


In  accordance  with  the  formal  definition  x(i  ^  i)  =  x,  the 
choice  j  =  i  is  interpreted  as  keeping  the  current  digraph  as 
it  is,  without  making  a  change. 

For  a  thorough  menu  of  what  kinds  of  statistics  s,;  are 
appropriate  for  actor  oriented  models,  see  [14],  [21],  We  will 
present  here  those  fundamental  structural  statistics  which  are 
used  in  this  study:  outdegree  (density),  reciprocity,  and  transi¬ 
tive  triplet,  three  cycle,  betweenness,  and  in-degree  popularity 
effects.  Figure  2  illustrates  the  effects  graphically,  and  gives 
mathematical  definitions. 

1)  Outdegree  (density)  effect,  defined  by  the  number  of 
outgoing  ties  for  some  given  ego  i  (see  Figure  2a). 


h 


i 


S*l(x)  =  Xij  =  1 

(a)  Outdegree 


Si 2(x)  =  XijXji  =  1 

(b)  Reciprocity 


^z3(x)  —  XijXjh% hj  1 

(c)  Transitive  Triplet 


^i4(x)  —  XijXjhXhi  —  1 

(d)  Three  cycle 


Si5  (x)  y  '.j  v,  X hi^ij  (  ^-  xhj  )  —  2 

(e)  Betweenness 


Sie(x)  ~  12  j  xij  1 2hxjh  —  4 

(f)  In-degree  popularity 


Fig.  2:  Graphical  representation  of  structural  effects 


2)  Reciprocity  effect,  defined  by  the  number  of  outgoing 
ties  that  are  matched  (or  reciprocated)  by  a  correspond¬ 
ing  incoming  tie  (see  Figure  2b). 

3)  Transitive  Triplet,  defined  by  the  number  of  patterns 
matching  Figure  2c.  With  a  positive  coefficient,  this 
effect  represents  a  tendency  towards  network  closure. 

4)  Three  cycles  [21]  states  that  this  effect  may  be  regarded 
as  a  generalized  reciprocity.  In  conjunction  with  positive 
estimates  for  network  closure  (e.g.  transitive  triplet),  a 
negative  three  cycles  coefficient  indicates  a  tendency 
towards  local  heirarchy  (see  Figure  2d). 

5)  Betweenness  represents  the  tendency  of  actors  to  want 
to  position  themselves  between  actors  who  are  not  tied 
to  each  other  (see  Figure  2e). 

6)  In-degree  Popularity  represents  the  tendency  of  actors 
to  want  to  form  relationships  with  high  in-degree  (i.e. 
popular)  alters  (see  Figure  2f). 

These  effects  features  are  regarded  as  representing  funda¬ 
mental  aspects  of  social  network  dynamics,  but  more  features 
may  also  be  formulated.  One  possibility  is  to  include  exoge¬ 
nous  actor-level  covariates  c  £  R™.  Throughout  this  study, 
we  will  make  use  of  sex,  smoking  behavior,  and  program 
membership  to  help  explain  variation  in  tie  choices.  Figure 
III  illustrates  graphically  how  the  four  possible  combinations 
of  ego/alter  ties  are  modeled.  For  the  multi-valued  covariate 
program  membership,  we  use  only  the  same  covariate  effect 
(i.e.  only  situations  in  Tables  IIIc  and  Hid  are  differentiated). 


See  also  [7],  [13],  [14],  [22]  for  thorough  developments  of 
possible  statistics  and  extensions  to  models  which  consider  the 
coevolution  of  network  and  endogenous  behavioral  character¬ 
istics. 

C.  Time  Heterogeneous  Parameters 

Using  the  score  type  test  of  [16],  [1]  proposed  a  test  for 
time  heterogeneous  parameters  in  SAOMs.  Their  formulation 
considers  that  the  parameters  /?  in  (6)  are  permitted  to  vary 
over  time.  Formally,  consider  a  SAOM  formulated  as  in  (6) 
with  some  set  of  effects  1C  =  {Kk  ■  k  £  Ni}  included.  We 
initially  assume  that  /3  does  not  vary  over  time,  yielding  a 
restricted  model.  Our  data  contains  |A4|  <C  |£|  observations, 
so  we  estimate  the  restricted  model  by  the  method  of  moments 
mentioned  in  Section  III-D.  For  reasons  introduced  in  Section 
??,  we  wish  to  test  whether  the  restricted  model  is  misspecified 
with  respect  to  time  heterogeneity.  An  unrestricted  model 
which  allows  for  time  heterogeneity  in  all  of  the  effects  is 
considered  as  a  modification  of  (6): 

4r}(x)  =  (Z3* +4a))stfc(x(*  j))  (8) 

Kkeic 

where  6 is  called  the  time  dummy  interacted  effect  pa¬ 
rameter  for  effect  k  and  period  a.  Define  also  the  vectors 
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A/-,j{x}  corresponds  to  the  change  statistic  for  the  proposed  tie  indicated  in  the  column  x. 

TABLE  III:  Graphical  representation  of  ego-alter  covariate  (selection)  effects.  The  table  illustrates  how  all  of  the  possible 
selections  of  smoker/non-smoker  alters  by  smoker/non-smoker  egos  are  explicitly  enumerated  with  the  parameters  of  the 
model.  A  cell  with  value  1  indicates  that  the  statistic  is  increased  by  the  indicated  tie.  The  inferences  drawn  in  the  results 
section  will  add  the  parameters  together  with  the  formula  indicated  in  the  right  hand  column. 


Equation  (8)  applies  for  updates  occurring  during  the  period 
Wa.  By  convention,  6^  =  0  for  all  k  such  that  Kk  G  1C 
so  that  the  first  period  is  called  the  base  period ;  therefore, 
the  vector  of  time  dummy  interacted  effect  parameters  S  has 
length  ( | W |  —  1)|/C|.3  The  test  of  [1]  is  the  following  omnibus 
test: 


H0  :  5  =  0 

Hi  :  (5  ^  0.  (9) 
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(10) 


This  approach  departs  from  [2]  in  at  least  one  important 
way.  [2]  estimates  a  model  for  each  intervening  period  between 
observations  separately.  This  is  an  equivalent  approach  to  pro¬ 
viding  time  dummies  (??)  for  all  effects  and  all  periods.  This 
approach  estimates  a  model  jointly  across  all  observations,  but 
only  includes  time  dummies  for  those  effects  which  contain 
evidence  for  time  heterogeneity. 


D.  Estimation 

A  key  feature  of  Equation  (7)  is  the  convenient  form  of 
its  log  odds  ratios  between  any  two  potential  next  networks 


"3  { 1) 

Because  d,  '  is  fixed,  it  is  implicitly  omitted  from  5k  and  <5  throughout 
the  notation. 


This  is  the  characteristically  simple  property  that  makes  esti¬ 
mation  in  classical  discrete  choices  quite  straightforward  (see, 
e.g.,  [23]),  and  is  also  the  basis  for  the  SAOM.  If  the  network 
updates  C  are  all  observed  so  that  we  have  full  information  (i.e. 
C  =  Ad),  and  in  the  usual  situation  that  the  rate  parameters 
are  independent  of  the  parameters  of  the  objective  function, 
a  convenient  partial  likelihood  is  available  for  the  statistical 
parameters  of  the  objective  function.  We  use  the  notation 
x(a)  =  x(m0)  =  x(Za)  for  the  a-th  network  observed  in  the 
dataset. 

Define  a  vector  of  binary  variables  d  such  that 

1  if  x(a+1)  =  x^aPi  j) 

(11) 

0  if  x(a+1)  ^  j) 

which  denotes  whether  actor  i  selected  variable  x'"-!  to  up¬ 
date.  Note  that  Ej=i^y  (a)  =  1  f°r  a ■  The  partial  log 


likelihood  for  the  objective  function  parameters  is 
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Under  regularity  conditions,  a  solution  /3j^L  to 

Vl(/3)  =  0 


(12) 


where 

X7-f—  — 

~{d^1’d/32,'") 

is  a  maximum  likelihood  estimate  for  /3. 

Unfortunately,  complete  information  on  each  network  up¬ 
date  is  extremely  rare,  and  a  likelihood  function  as  in  (12)  is 
not  readily  available.  Even  when  tie  creation  is  observed  in 
continuous  time,  it  is  not  often  that  observations  concerning 
termination  of  ties  are  also  available.  It  is  far  more  often 
the  case  that  the  data  observed  will  be  in  the  form  of  panel 
data,  where  typically  \A4\  <C  \C\.  Accordingly,  the  method 
of  moments  as  proposed  by  [13]  can  be  used  as  an  alterna¬ 
tive  method  for  obtaining  reasonable  estimates.  Consider  the 
estimating  function 

9n(0',zn)  = 

(E{u(X(o))  |  =x<°-1>}  -u(x(al)), 

ma£.M 

(13) 

which  is  simply  the  sum  of  deviations  between  the  expected 
value  of  the  statistics  for  the  simulated  networks  and  the 
observed  networks;  zn  simply  means  all  of  the  available  data, 
and  6  is  a  vector  of  parameters  for  the  objective  and  rate 
functions  described  earlier.  ■u(x)  is  a  function  that  corresponds 
to  appropriately  chosen  statistics  calculated  from  the  digraph 
for  the  parameters  6  (based  on  the  statistics  in  Section  III-B). 
The  method  of  moments  involves  finding  the  moment  estimate 
9  solving  the  moment  equation 

gn(0;zn)  =  0  (14) 

The  specific  details  of  fitting  moment  estimates  are  rather 
involved,  and  entail  simulating  networks  X':V  many  times  to 
achieve  a  reliable  result  for  the  expectation  in  (14).4  This 
simulation  is  very  straightforward.  Take  an  initial  network 
x(7i)  and  proceed  as  follows  for  each  update  l„  £  C: 

1)  Set  la  =  Za_ i  +  Expon(A+) 

2)  Select  actor  i  with  probability  '  ^  a  1  \ '  . 

EUa-l) 

3)  Select  actor  j  with  probability  pij(la_ i). 

4)  If  i  ^  j,  Set  1  Xij{la—  i). 

5)  Repeat  until  some  specified  conditions  (e.g.  number  of 
updates  |£|  or  some  holding  time  (Z|£|  —  la)  is  exceeded) 
are  satisfied. 

4That  this  can  take  a  considerable  amount  of  time  per  estimation  motivates 
the  use  of  the  score-type  test  of  [1] 


See  [13]  for  guidelines  on  the  selection  of  appropriate 
statistics  for  it(X(i))  and  information  on  how  to  estimate  the 
root  of  gn(zn,  6 ),  and  [24]  for  the  estimation  of  the  derivative 
matrix,  covariance  matrix,  and  standard  errors. 

E.  Modeling  approach 

We  aim  to  analyze  the  data  of  [2],  [15]  with  a  time 
heterogeneous  model  specification,  as  in  Equation  (8).  Using 
the  forward  model-selection  technique  of  [1]  and  elaborated 
in  [17],  we  will  select  a  model  with  time  dummy  interacted 
parameters  until  the  test  in  (9)  fails  to  reject  Hq  (this  model 
will  be  called  Model  B.3,  see  below).  We  will  then  consider 
subsets  of  these  parameters  as  alternate  model  specifications 
and  estimate  them  using  method  of  moments: 

•  Model  A.l:  Structural  effects  only.  This  model  will  be 
used  primarily  to  assess  the  stability  of  the  structural 
effects  as  we  add  more  parameters. 

•  Model  A.2:  Structural  effects  and  covariate  effects.  Pa¬ 
rameter  estimates  for  this  model  will  yield  a  basis  for 
comparison,  so  that  we  may  compare  the  interpretation 
that  would  result  from  ignoring  potential  time  heterogene¬ 
ity. 

•  Model  B.l:  Structural  effects  only,  but  using  the  struc¬ 
tural  effect  time  heterogeneity  parameters  included  in 
Model  B.3.  As  before,  this  model  will  be  used  primarily 
to  assess  the  stability  of  the  structural  effects  as  we  add 
more  parameters. 

•  Model  B.2:  Structural  effects  and  covariate  effects,  but 
using  the  structural  effect  time  heterogeneity  parameters 
included  in  Model  B.3.  No  covariate  time  heterogeneity 
parameters  are  used.  This  will  help  us  to  assess  whether 
any  time  heterogeneity  ultimately  found  in  them  under 
Model  B.3  can  be  explained  by  time  heterogeneity  in  the 
structural  effects. 

•  Model  B.3:  This  is  the  model  which,  after  using  the 
forward  model  selection  of  [1],  fails  to  reject  //<,  for 
(9),  indicating  that  much  of  the  time  heterogeneity  in  the 
chosen  effects  has  been  accounted  for. 

All  of  the  model  terms  will  include  rate/covariate  interaction 
terms  presented  in  ??  for  smoking  behavior,  sex,  and  program 
membership.  After  estimating  these  models,  we  use  Equation 
(7)  to  calculate  conditional  probabilities  for  smoker  and  non- 
smoker  alter  selection  so  that  we  may  more  easily  interpret 
Model  A  and  Model  B. 


IV.  Results 

The  results  of  the  five  models  presented  in  Section  III-E  are 
presented  in  Table  IV.  Note  that  the  list  of  effects  included  in 
the  first  column  is  not  exhaustive  for  time  heterogeneity  terms; 
they  represent  the  final  model  arrived  at  after  using  the  model 
selection  procedure  of  [1],  [17],  which  incrementally  adds 
time  dummy  interected  parameters  until  the  joint  test  fails  to 
reject  H0  given  in  (9).  Perhaps  surprisingly,  the  program/rate 
interaction  terms  are  near-zero  with  large  standard  errors 
across  all  models  and  are  therefore  not  presented. 


Across  all  estimations,  convergence  is  quite  good,  as  ev¬ 
idenced  by  very  low  t*  test  statistics  (see  [14]  for  more 
information). 

A.  Model  A:  Time  homogeneous  parameters 

Estimates  for  structural  effects  are  stable  across  Model  A.l 
and  Model  A. 2  (with  the  exception  of  outdegree),  which  is 
encouraging.  We  would  expect  outdegree  to  be  systematically 
different  on  the  basis  of  Table  III,  since  inclusion  of  the 
other  covariate  effects  turns  the  outdegree  effect  into  a  sort 
of  base  case.  Interestingly,  the  relatively  large  standard  error 
and  small  magnitude  of  the  outdegree  estimate  indicates  that 
we  have  little  evidence  to  reject  Hq  :  jd i  =  0.  As  the 
network  is  rather  small,  and  there  are  three  effects  with  large 
(negative)  and  significant  estimates,  this  result  should  not  be 
of  much  concern.  The  structural  effects  results  indicate  reci¬ 
procity  and  a  tendency  towards  closure  as  we  expect  in  many 
social  settings.  A  negative  three  cycles  effect  indicates  some 
tendencies  towards  local  heirarchy.  The  small  and  negative 
betweenness  coefficient  could  proxy  for  tendencies  towards 
network  closure,  or  it  could  be  an  artifact  of  a  small  network. 

We  see,  as  might  be  expected,  that  membership  in  the  same 
program  causes  a  slight  increase  in  probability  of  friendship 
formation.5 

Using  the  result  of  Table  III,  it  is  possible  to  show  that 
the  probability  of  a  male  selecting  a  male  is  .24  greater  than 
selecting  a  female,  ceteris  paribus.  Women  select  women 
versus  men  with  probility  .12  less  than  selecting  a  male, 
indicating  a  universally  greater  tie-formation  attractiveness  for 
males.  These  results  can  be  calculated  directly  from  (7)  when 
considering  an  ego  i,  conditioned  on  the  sex  of  i  and  on  forcing 
a  tie  formation  for  i,  with  two  opportunities  *  j  and  i  k 
where  j  and  k  have  opposite  sexes. 

On  the  basis  of  Model  A’s  estimation  results,  we  might 
conclude  that  smoking  has  no  significant  effect  on  friend¬ 
ship  formation,  due  to  the  large  standard  errors  associated 
with  the  parameter  estimates  for  the  smoking  covariate  ef¬ 
fects.  Interpretation  of  the  parameters  in  a  similar  manner 
as  performed  for  sex  yields  the  result  that  smokers  choose 
smoking  alters  with  probability  .55,  and  non-smokers  choose 
smokers  with  probability  .44.  This  indicates  a  slight  tendency 
towards  universally  more  tie-formation  related  attractiveness 
of  smoking  alters,  but  again,  due  to  the  size  of  the  standard 
errors,  parameter  interpretation  of  Model  A  would  likely  fail  to 
uncover  a  meaningful  relationship  between  smoking  behavior 
and  social  relationship  formation. 

We  note  that  the  time  heterogeneity  test  of  [1]  for  Models 

A. l  and  A. 2  rejects  the  hypothesis  of  time  homogeneous 
parameters,  which  helps  to  motivate  the  exploration  of  Model 

B. 

5Using  Equation  (7),  we  expect  roughly  a  .13  increase  in  probability  for 
Xij  if  i,  j  are  in  the  same  program.  This  result  comes  from  considering  a 
notional  case  of  a  some  selected  alter  considering  the  ties  i  j  and  i  k 
exclusively.  Using  Equation  (7)  and  the  estimates  of  Table  IV.  it  is  easy  to 
show  that  if  i,j  are  in  the  same  program  and  i,  k  are  not,  p/:  j  =  .627. 


B.  Model  B:  Time  heterogeneous  parameters 

We  have  a  similar  result  in  insignificant  outdegree  estimates 
but  again  due  to  the  small  size  of  the  network  an  presence 
of  large  (negative),  significant  estimates  for  other  effects, 
there  is  little  reason  for  concern.  The  structural  effects  largely 
agree  with  the  results  of  Model  A  in  both  magnitude  and 
in  direction,  so  we  still  detect  tendencies  towards  network 
closure,  reciprocity,  and  local  heirarchy.  Where  the  models 
begin  to  differ  is  in  the  standard  errors  of  the  covariate  effects, 
and  in  the  magnitudes  of  the  smoking  effects.  Where  Model 
A  gives  us  little  reason  to  think  that  smoking  is  an  important 
factor  for  explaining  network  formation,  the  largest  estimates 
(in  magnitude)  for  Model  B  are  actually  smoker  selection 
effects.  In  order  to  fix  ideas  on  how  these  two  models  differ, 
we  construct  a  so-called  ego-alter  selection  table,  where  we 
suppose  again  that  some  ego  i  has  a  fixed  smoking  behavior, 
and  that  i  is  forced  to  form  a  tie  with  one  of  two  alters,  j  or 
k.  These  two  alters  have  opposite  smoking  behaviors,  and  we 
consider  the  probability  that  i  selects  either  on  the  basis  of  the 
parameter  estimates. 

C.  Ego/alter  selection  results  for  Model  A  and  Model  B 

Table  V  presents  the  ego/alter  selection  table.  Using  the 

coefficients  indicated  by  Table  III  and  the  estimates  from 
Table  IV,  we  estimate  for  each  of  the  four  possible 

combinations  of  smoker/non-smoker  ego/alter.  The  estimates 
reported  are  calculated  from  column  two  for  each  of  the 
models,  following  Table  III.  The  probabilities  reported  in 
parenthesis  refer  to  the  proportion  of  times  that  some  ego  i 
with  a  given  smoking  behavior  Vi  would  form  a  tie  with  alter 
j  with  a  given  smoking  behavior  Vj  instead  of  another  alter 
k  with  smoking  behavior  1  -  v;i  in  a  notional,  empty  network 
containing  actors  i,j,k  only.  These  probabilities  are  available 
directly  from  Equation  (7). 

Models  A.l  and  B.l  are  not  of  much  interest  for  smoker 
selection  interpretation,  as  they  do  not  include  any  terms  for 
smoking  covariates  (accordingly,  the  probabilites  are  equal  for 
smokers  and  non-smokers).  Model  A. 2  suggests  that  actors 
have  a  rather  small  preference  on  the  basis  of  smoking 
behaviors,  a  result  discussed  in  the  last  section.  The  selection 
probabilities  differ  by  a  rather  small  amount  (roughly  .05 
in  either  case),  and  both  smokers  and  non-smokers  appear 
to  have  a  slight  selection  preference  for  smoking  alters.  In 
other  words,  the  data  supports  the  notion  that  smokers  are 
(slightly)  universally  more  popular,  rather  than  that  smokers 
select  smokers  and  non-smokers  select  non-smokers.  We  note, 
however,  that  these  results  would  have  to  be  presented  with 
caution,  due  to  the  size  of  the  corresponding  standard  error 
(which  is  roughly  twice  the  size  of  /?i3,  e.g.). 

Turning  to  Model  B,  it  is  apparent  that  the  inclusion 
of  time  heterogeneous  parameters  lends  a  more  substantive 
interpretation.  An  immediate  difference  between  Model  A 
and  Model  B  is  that  the  standard  errors  are  smaller  in  the 
estimates  of  smoking  covariate  effects  when  time  heteroge¬ 
neous  parameters  are  specified,  so  we  can  draw  inference 
with  less  uncertainty.  Further,  the  difference  in  magnitudes 


1  2  3  4  5 


Fig.  3:  Selection  probabilities  given  by  estimates  for  Model 
B.3  shown  in  Table  V.  The  dotted  line  represents  the  prob¬ 
ability  that  an  ego  who  smokes  selects  another  smoking  ego 
for  tie  formation  as  opposed  to  a  non-smoker.  The  solid  line 
represents  the  probability  that  a  non-smoker  selects  another 
non-smoker  for  tie  formation  as  opposed  to  a  smoker.  A 
horizontal  reference  line  is  given  at  p  =  .5  to  show  estimates 
which  indicate  homophilous  selection  behaviors. 

of  the  smoking  covariate  effect  parameter  estimates  between 
the  two  models  is  substantial.  Model  B.2,  which  controls  for 
heterogeneity  in  structural  effects  but  not  selection  effects, 
indicates  that  smokers  are  universally  more  popular  as  alter 
choices  regardless  of  the  ego  smoking  behavior,  and  that  there 
is  an  additional  homophily  effect  of  roughly  .07  for  smokers 
to  select  other  smokers.  After  permitting  smoking  selection 
effects  to  vary  over  time  in  Model  B.3,  we  a  more  pronounced 
version  of  the  same  result  as  in  Model  B.2  for  periods  1  and 
2,  and  the  effect  diminishes  for  periods  3,  4,  and  5.  After 
Period  2,  smokers  are  no  longer  universally  more  attractive, 
and  we  see  a  homophily  effect  form  on  the  basis  of  smok¬ 
ing  behaivor.  Figure  3  illustrates  the  selection  probabilities 
for  smoker/smoker,  nonsmoker/nonsmoker  selection  given  by 
the  results  of  Model  B.3.  [1]  finds  that  omission  of  time 

heterogeneity  causes  estimates  which  in  some  sense  average 
over  the  heterogeneity,  and  it  seems  that  perhaps  we  have  a 
similar  result  here,  where  the  effect  is  in  some  sense  covered 
up  in  time  homogeneous  models. 

V.  Discussion 

With  a  time  homoegenous  model  specification,  results  indi¬ 
cated  no  sound  support  for  the  notion  that  smoking  behavior 
had  a  substantive  impact  on  the  formation  of  social  relation¬ 
ships.  Using  a  model  with  time  heterogeneity  terms,  we  were 
able  to  uncover  smoking  homophily  effects  after  period  1 
had  passed.  The  findings  of  this  study  could  have  important 
implications  in  two  respects:  First,  the  use  of  time  heteroge¬ 
neous  specifications  can  be  potentially  very  important  towards 
interpretation  of  parameters  of  interest  in  an  array  of  applied 
studies.  As  alluded  to  in  [1],  time  heterogeneity  in  parameters 
of  interest  can  be  intrinsically  interesting.  This  study  illustrates 


a  case  where  it  is  not  only  interesting,  but  failing  to  account 
for  behavioral  changes  over  time  can  lead  to  misspecified 
models  which  cover  up  important  social  dynamics.  Second, 
while  the  size  and  scope  of  this  small  study  is  rather  limited, 
and  the  results  may  not  be  generally  applicable,  more  research 
is  indicated  in  the  area  of  policy  regarding  the  establishment 
of  smoking  areas.  The  researchers  note  that  these  smoking 
areas  separated  smokers  and  non-smokers  during  coffee  breaks 
between  lectures. 

The  original  study  of  [2]  was  unable  to  uncover  a  coherent 
picture  of  how  smoking  affected  friendship  formation.  In 
contrast,  the  forward  model  selection  and  time  heterogeneity 
testing  employed  by  this  paper  was  able  to  discover  a  coherent 
picture  of  smoking  effects  which  decline  over  time,  but  are 
important  in  the  initial  vetting  of  potential  friendships. 

We  cannot  ascertain  influence  effects  from  this  particular 
dataset,  but  future  research  might  study  how  smoking  areas 
reinforce  smoking  behaviors.6  It  is  plausible  that  smoking 
areas  increase  smoking  homophily  selection  effects-a  notion 
supported  by  the  results  of  this  study-and  also  plausible  that 
an  ego’s  friendly  contacts’  smoking  behavior  influences  the 
ego’s  smoking  behavior. 

As  stated  before,  the  results  of  this  study  may  not  be 
generally  applicable  for  a  number  of  reasons.  The  dataset 
is  small,  and  a  number  of  students  failed  to  respond,  which 
may  have  caused  some  serious  biases  in  the  results.  As  noted 
by  [2],  the  operationalization  of  friendship  was  not  optimal, 
and  perhaps  a  social  activity  scale  would  have  been  better. 
A  number  of  covariates  have  been  left  out  which  may  have 
had  some  important  explanatory  power,  e.g.  performance  at 
the  school  or  extracurricular  activities. 

What  we  can  say  with  some  certainty  is  that  time  hetero¬ 
geneity  is  a  potentially  important  feature  of  actor  behavior, 
and  that  failing  to  account  for  it  in  model  selection  can 
cause  results  which  vary  widely  from  a  time  heterogeneous 
specification.  While  positive  statements  concerning  the  context 
in  which  the  study  was  conducted  in,  i.e.  regarding  policies 
on  smoking  areas,  are  tenuous,  the  results  do  motivate  further 
research  into  the  role  of  policies  regarding  the  establishment 
of  smoking  areas  on  a  broader  scale. 

VI.  Future  Work 

This  paper  represents  a  contribution  to  the  methodology  of 
studying  risky  behavior  and  social  networks.  The  dataset  used 
was  selected  because  of  its  position  as  one  of  the  first  datasets 
in  the  literature  to  be  analyzed  with  a  stochastic  actor  oriented 
model.  The  results  of  the  study  indicate  that  time  heterogeneity 
can  play  an  important  role  in  what  inference  is  drawn  from  a 
study  at  hand. 

Nonetheless,  the  context  for  this  methods  study  could  be 
enlarged  into  a  wide-scale  study  of  smoking  behavior  and 
social  network  evolution.  [26]  has  conducted  a  trial  study 
which  could  be  enlarged  in  such  a  manner.  Surveys  are  given 

6The  smoking  behavior  was  collected  at  only  one  time  period.  In  other 
studies,  e.g.  [25],  such  information  is  collected  longitudinally. 


to  school  children  periodically  which  ask  for  relationships  of 
friendship  and  for  personal  assessments  of  risky  behaviors. 
Many  classrooms  and  schools  are  observed  over  the  period 
of  perhaps  two  or  three  years.  A  multilevel  approach  is  then 
used  to  isolate  classroom  effects  from  the  main  effects,  and 
inference  is  drawn  on  the  latter.  Using  the  evidence  from  this 
paper,  attention  could  be  paid  to  time  heterogeneity  across 
these  two  years  when  drawing  inference. 

Determining  the  risk  factors  for  adolescent  smoking  is 
among  the  highest  priority  for  researchers  because  of  the  well 
established  adverse  health  effects  from  smoking.  By  collect¬ 
ing  data  over  time  and  using  the  stochastic  actor  oriented 
framework,  previously  elusive  peer  effects  can  be  disentangled 
from  selection  effects.  If  these  risk  factors  can  be  identified, 
intervention  programs  can  be  made  more  effective. 
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Effect 

Coefficient 

Model  A.l 

Model  A. 2 

Model  B.l 

Model  B.2 

Model  B.3 

Outdegree 

01 

-.18  (.25) 

-.30  (.37) 

-.24  (.29) 

-.10  (.41) 

-.01  (.44) 

Reciprocity 

0 2 

1.48  (.15) 

1.38  (.14) 

1.53  (.16) 

1.36  (.16) 

1.38  (.16) 

->  5 

«5(s) 

°2 

-.06  (.27) 

-.05  (.27) 

-.40  (.30) 

Trans.  Triplets 

03 

.42  (.03) 

.42  (.03) 

.54  (.04) 

.55  (.04) 

.56  (.04) 

->  3 

P') 

°3 

-.11  (.06) 

-.12  (.06) 

-.22  (.09) 

->  4 

p) 

°3 

-.19  (.07) 

-.19  (.07) 

-.25  (.09) 

-»  5 

p) 

°3 

-.31  (.06) 

-.31  (.06) 

-.38  (.08) 

Three  Cycles 

04 

-.45  (.06) 

-.43  (.06) 

-.55  (.07) 

-.52  (.07) 

-.53  (.07) 

2 

p) 

°4 

-.47  (.15) 

-.45  (.14) 

-.53  (.17) 

4 

p) 

°4 

.38  (.11) 

.39  (.11) 

.37  (.11) 

Betweenness 

05 

-.24  (.05) 

-.20  (.04) 

-.29  (.06) 

-.26  (.05) 

-.27  (.05) 

In-deg.  Popul. 

06 

-.14  (.03) 

-.13  (.02) 

-.15  (.03) 

-.14  (.03) 

-.14  (.03) 

Sex  Ego 

07 

-.12  (.15) 

-.08  (.18) 

-.08  (.18) 

Same  Sex 

08 

-.47  (.22) 

-.40  (.24) 

-.40  (.24) 

Sex  i  x  j 

09 

1.49  (.46) 

1.50  (.53) 

1.46  (.53) 

Same  Program 

010 

.52  (.08) 

.59  (.10) 

.59  (.10) 

Smoke  Ego 

0ii 

-.02  (.14) 

.08  (.16) 

.10  (.17) 

->  2 

p) 

°n 

-.64  (.34) 

->  5 

x(5) 

°n 

-1.20  (.33) 

Same  Smoke 

012 

-.24  (.51) 

-.55  (.55) 

-.52  (.57) 

2 

p) 

°12 

.50  (.43) 

-A  3 

p) 

°12 

.93  (.47) 

-»  4 

p) 

°12 

.66  (.57) 

->  5 

p) 

°12 

1.03  (.54) 

Smoke  i  x  j 

013 

.46  (1.03) 

1.46  (1.12) 

1.94  (1.16) 

2 

p) 

°13 

-1.11  (1.15) 

-»  3 

j(3) 

°13 

-1.76  (1.20) 

4 

p) 

°13 

-1.98  (1.39) 

-A  5 

-1.96  (1.30) 

Rate  1 

P’ 

3.51  (.82) 

3.42  (.75) 

3.30  (.52) 

3.31  (.48) 

3.29  (.51) 

Rate  2 

a<2) 

4.91  (.83) 

4.82  (.72) 

4.62  (.63) 

4.58  (.61) 

4.61  (.65) 

Rate  3 

Ai(3) 

7.87  (1.23) 

7.59  (1.14) 

7.76  (1.21) 

7.68  (1.00) 

7.92  (1.05) 

Rate  4 

M4) 

6.03  (.64) 

6.28  (.57) 

6.18  (.61) 

6.24  (.65) 

6.04  (.66) 

Rate  5 

MB) 

7.08  (.93) 

7.14  (.88) 

7.09  (.78) 

7.06  (.82) 

7.13  (.81) 

Smoke  x  Rate 

A 2 

-.14  (.34) 

-.20  (.38) 

.07  (.38) 

.08  (.15) 

.08  (.16) 

->  2 

°2 

.09  (.31) 

.07  (.30) 

.05  (.32) 

->  3 

p) 

°2 

-.62  (.27) 

-.58  (.31) 

-.53  (.32) 

Sex  x  Rate 

A3 

-.30  (.32) 

-.24  (.25) 

-.28  (.15) 

-.29  (.14) 

-.26  (.16) 

H0  :  4+)  =  0 

p 

0 

0 

.01 

.04 

.19 

Convergence 

max  \t*\ 

.11 

.07 

.06 

.06 

.06 

TABLE  IV:  Results  of  estimation  for  five  models.  Standard  errors  reported  in  parenthesis. 


X 

A/y{x  } 

A.l 

A. 2 

B.l 

B.2 

B.3r 

B.32 

B.33 

B.34 

B.3S 

<o— 

— #7 

k 

-.18 

(.5) 

-.30 

(.56) 

-.24 

(.5) 

-.10 

(.63) 

-.17 

(.63) 

-.17 

(.50) 

-.17 

(.40) 

-.17 

(.47) 

-.17 

(.38) 

iO- 

— Oi 

k + ki 

-.18 

(.5) 

-.54 

(.44) 

-.24 

(.5) 

-.65 

(■37) 

-.69 

(•37) 

-.19 

(.50) 

.24 

(.60) 

-.03 

(.53) 

.34 

(.62) 

'  • - o  j  k  +  ki  --18  --32  --24  --02  --27  --91  --27  --27  -1-47 

(.5)  (.45)  (.5)  (.29)  (.19)  (.31)  (.36)  (.48)  (.38) 

i% - *-#  J  k+ki  +  ki  +  Ph  --18  --10  -.24  .89  1.15  -.10  .32  -.17  -.98 

_ (.5)  (.55)  (.5)  (.71)  (.81)  (.69)  (.64)  (.52)  (.62) 

Q  corresponds  to  a  smoker,  A fy  corresponds  to  the  change  statistic  for  the  given  selection  case. 

TABLE  V:  Results  for  ego  selection  on  the  ceteris  paribus  basis  of  smoking  attributes.  k  =  k,  +  <51. ,  or  the  base  estimate 
plus  the  dummy  given  by  the  corresponding  column.  If  a  coefficient  was  not  included  in  a  particular  model,  it  is  fixed  to 
zero  and  calculated  thus.  Estimated  selection  probabilities  given  ego  i  s  smoking  status  are  given  in  parenthesis  below  each 

exp  (fi  ■  (x) ) 

estimate  (i.e.  (x)  =  - — - - - — )  where  k  is  selecting  an  actor  with  the  opposite  smoking  behavior  from  j; 

exp(/jj(x))  +  exp(/jfc(x)) 

see  Equation  (7)).  Note  that  the  conditional  probabilities  of  selection  for  ego  i  given  her  smoking  behavior  sum  to  one. 


